A Parallel Implementation of the Nonsymmetric QR Algorithm for Distributed Memory Architectures
نویسندگان
چکیده
One approach to solving the nonsymmetric eigenvalue problem in parallel is to parallelize the QR algorithm. Not long ago, this was widely considered to be a hopeless task. Recent e orts have made signi cant advances, although the methods proposed up to now have su ered from scalability problems. This paper discusses an approach to parallelizing the QR algorithm that greatly improves scalability. A theoretical analysis indicates that the algorithm is ultimately not scalable, but the nonscalability does not become evident until the matrix dimension is enormous. Experiments on the Intel ParagonTM system, the IBM SP2 supercomputer, and the Intel ASCI Option Red Supercomputer are reported.
منابع مشابه
A Distributed Memory Implementation of the Nonsymmetric QR Algorithm
The QR algorithm is the crux of the serial nonsymmetric eigenvalue problem. Recent eeorts to parallelize this algorithm have made signiicant advances towards solving the parallel nonsym-metric eigenvalue problem. Most methods to date suuer a scalability problem. In this talk we discuss an approach for parallelizing QR which overcomes many of the disadvantages to date. We also give insights into...
متن کاملA Parallel Implementation of the Nonsymmetric Qr Algorithm for Distributed Memory
One approach to solving the nonsymmetric eigenvalue problem in parallel is to parallelize the QR algorithm. Not long ago, this was widely considered to be a hopeless task. Recent efforts have led to significant advances, although the methods proposed up to now have suffered from scalability problems. This paper discusses an approach to parallelizing the QR algorithm that greatly improves scalab...
متن کاملPolynomial Acceleration for Restarted Arnoldi Iteration and its Parallelization
We propose an accelerating method for the restarted Arnoldi iteration to compute a number of eigenvalues of the standard eigenproblem Ax = x and discuss the dependence of the convergence rate of the accelerated iteration on the distribution of spectrum. The e ectiveness of the approach is proved by numerical results. We also propose a new parallelization technique for the nonsymmetric double sh...
متن کاملLAPACK Working Note # 216 : A novel parallel QR algorithm for hybrid distributed memory HPC systems ∗
A novel variant of the parallel QR algorithm for solving dense nonsymmetric eigenvalue problems on hybrid distributed high performance computing (HPC) systems is presented. For this purpose, we introduce the concept of multi-window bulge chain chasing and parallelize aggressive early deflation. The multi-window approach ensures that most computations when chasing chains of bulges are performed ...
متن کاملA novel parallel QR algorithm for hybrid distributed memory HPC systems
A novel variant of the parallel QR algorithm for solving dense nonsymmetric eigenvalue problems on hybrid distributed high performance computing (HPC) systems is presented. For this purpose, we introduce the concept of multi-window bulge chain chasing and parallelize aggressive early deflation. The multi-window approach ensures that most computations when chasing chains of bulges are performed ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- SIAM J. Scientific Computing
دوره 24 شماره
صفحات -
تاریخ انتشار 2002